Improving Topic Evaluation Using Conceptual Knowledge

نویسندگان

  • Claudiu Cristian Musat
  • Julien Velcin
  • Stefan Trausan-Matu
  • Marian-Andrei Rizoiu
چکیده

The growing number of statistical topic models led to the need to better evaluate their output. Traditional evaluation means estimate the model’s fitness to unseen data. It has recently been proven than the output of human judgment can greatly differ from these measures. Thus the need for methods that better emulate human judgment is stringent. In this paper we present a system that computes the conceptual relevance of individual topics from a given model on the basis of information drawn from a given concept hierarchy, in this case WordNet. The notion of conceptual relevance is regarded as the ability to attribute a concept to each topic and separate words related to the topic from the unrelated ones based on that concept. In multiple experiments we prove the correlation between the automatic evaluation method and the answers received from human evaluators, for various corpora and difficulty levels. By changing the evaluation focus from a statistical one to a conceptual one we were able to detect which topics are conceptually meaningful and rank them accordingly.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study Course Structure Personalized Planning Using Concept Maps

Personalized study planning framework consists four graphs for personalized study planning: a graph representing a conceptual structure of study program; a graph representing study course; a graph visualizing each topic of study course using concept map; a graph representing learning objects. This paper describes third graph – a graph for study course topic structure displaying and knowledge as...

متن کامل

Inferential Realization Constraints on Functional Anaphora in the Centering Model

We present an inference-based text understanding methodology for the resolution of functional anaphora in the context of the centering model. A set of heuristic realization constraints is proposed, which incorporate language-independent conceptual criteria (based on the well-formedness and conceptual strength of role chains in a terminological knowledge base) and language-dependent information ...

متن کامل

Methodology of conceptual review in the health system

Background: Conceptual review is a creative research method for generating new knowledge in the context of a vague and complex concept that helps to explain and clarify the concept, its components and its relation to related concepts. This study aimed to explain the methodology of conceptual review in the health system. Methods: Articles related to the conceptual research method were searched ...

متن کامل

TREC-7 Evaluation of Conceptual Interlingua Document Retrieval (CINDOR) in English and French

TextWise LLC. participated in the TREC-7 Cross-Language Retrieval track using the CINDOR system, which utilizes a “conceptual interlingua” representation of documents and queries. The current CINDOR research system uses a conceptual interlingua constructed around the Princeton WordNet, which we are mapping into French and Spanish. The use of an interlingual representation of documents and queri...

متن کامل

Viewing and Querying Topic Maps in terms of RDF

Both Topic Maps and RDF are popular semantic web standards designed for machine processing of web documents. Since these representations were originally created for different purposes, they have conceptual differences in their data models, and therefore have different tools to parse, store, and query them. However, there are more tools to handle RDF data than those existing for Topic Maps. Our ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011